1,070
1.1K
movies
eye 1,070
favorite 0
comment 0
Blip.tv changed its terms of service in November 2013and removed videos of conferences and tech talks. CarlFK's blip account had videos of tech talks in the Chicago area.This is an archive of his account
Topic: programming
954
954
movies
eye 954
favorite 0
comment 0
**Machine Learning** is about *using the right features, to build the right models, to achieve the right tasks* [[Flach, 2012]][0] However, to come up with a definition of what actually means **right** for the problem at the hand, it is required to analyse huge amounts of data, and to evaluate the performance of different algorithms on these data. However, deriving a working machine learning solution for a given problem is far from being a *waterfall* process. It is an iterative process where...
590
590
movies
eye 590
favorite 0
comment 0
Introduction =========== This talk will first introduce the audience to WebRTC and then discuss about how to implement the server side logic of a WebRTC app using Python. WebRTC is a free, open project that enables web browsers with plugin-less Real-Time Communications (RTC) capabilities via simple JavaScript APIs. What makes WebRTC special is that the data travels from one client to another without going through the server. The main functions of WebRTC can be broadly categorized into three...
588
588
movies
eye 588
favorite 0
comment 0
496
496
movies
eye 496
favorite 0
comment 0
Who am I ? ========= * a programmer * a high school student * a blogger * Pythonista * and tea lover - Creator of freepythontips.wordpress.com - I made soundcloud-dl.appspot.com - I am a main contributor of youtube-dl. - I teach programming at my school to my friends. - It's my first programming related conference. - The life of a python programmer in Pakistan What this talk is about ? ================== - What is Web Scraping and its usefulness - Which libraries are available for the job -...
430
430
movies
eye 430
favorite 0
comment 0
Python has been used in educational programmes ever since. With a bandwidth that large, navigating the landscape of Python tutorials is hard indeed. This talk will look at successful Python teaching material. From the numerous iterations our material has gone through, we draw conclusions on what's crucial in teaching Python. It will introduce how the OpenTechSchool is teaching Python and what measures it found most effective in spreading programming in general and Python in particular. Among...
379
379
movies
eye 379
favorite 0
comment 0
Kafka is a distributed, persistent, replicated publish-subscribe system which can be used for variety of use cases like activity stream processing, log collection etc. The design philosophy of Kafka is such that it enables real-time processing. Storm is a high-volume, continuous, reliable stream processing system developed at BackType and open-sourced by Twitter. Storm is being widely used in lot of organizations and has variety of uses-cases like: * Realtime analytics * Distributed RPC * ETL...
374
374
movies
eye 374
favorite 0
comment 0
Python is a great language for writing command-line tools - which is why so much of Linux is secretly written in Python these days. Unfortunately, what starts as a simple script can quickly get out of hand as more features are added and more people start using it! The talk will consist of a tour through various useful libraries and practical code showing how each can be used, and include advice on how to best structure simple and complex command-line tools. Things to consider when writing...
356
356
movies
eye 356
favorite 0
comment 0
298
298
movies
eye 298
favorite 0
comment 0
264
264
movies
eye 264
favorite 0
comment 0
The best filter algorithm to fuse multiple sensor informations is the Kalman filter. To implement it for non-linear dynamic models (e.g. a car), analytic calculations for the matrices are necessary. In this talk, one can see, how the IPython Notebook and Sympy helps to develop an optimal filter to fuse sensor information from different sources (e.g. acceleration, speed and GPS position) to get an optimal estimate. more: http://balzer82.github.io/Kalman/
Topics: python, pydata, pydataberlin2014
257
257
movies
eye 257
favorite 0
comment 0
We present the Shogun Machine Learning Toolbox, a unified framework for Machine Learning algorithms. Machine Learning (ML) is the art of finding structure in data in an automated way and has given rise to a wide range of applications such as recommendation systems, object recognition, brain-computer interfaces, robotics, predicting stock prices, etc. Our toolbox offers extensive bindings with other software and computing languages, Python being the major target. The library was initiated in...
251
251
movies
eye 251
favorite 0
comment 0
How we created a scalable mass VM hosting for open source web apps with python, LXC and linux with a web-UI based on django and angularjs. We'll show the underlying architecture of this service, several linux internals that make this possible and we'll talk about bitter failure during development. This talk will be python- and linux-centric with some hints for integrating angularjs into django.
244
244
movies
eye 244
favorite 0
comment 0
240
240
movies
eye 240
favorite 0
comment 0
*(Special note: [Pythonista][1] and [Editorial][2] are designed and created **in Berlin**).* 1. Definition and history of Pythonista 2. Constraints that Apple's App Store imposes on development environments 3. Pythonista User Experience and [Community Forum][3] 4. [iOS-specific modules][4]: contacts, Dropbox, location, notification, photo, scene, sound, speech 5. Source code demonstrations * GPS: Combine location and requests to deliver local weather in real-time * Image: Use photo and PIL to...
230
230
movies
eye 230
favorite 0
comment 0
216
216
movies
eye 216
favorite 0
comment 0
A new class of cheap consumer EEG devices allows ordinary hackers and even high school students a glimpse into the human brain. This talk will present how to use the "Neurosky Mindwave" headset with python software, and lay out the basic scientific and technical background. The Mindwave Mobile is a device that can be easily talked to using bluetooth, and it talks a binary protocol which is specifically designed to be useful without much computing power in the receiving device or...
203
203
movies
eye 203
favorite 1
comment 0
The Python DB-API 2.0 provides a direct interface to many popular database backends. It makes interaction with relational database very straight forward and allows tapping into the full set of features these databases provide. The talk will cover advanced database topics which are relevant in production environments such as locks, distributed transactions and transaction isolation. ---- The talk will give an in-depth discussion of advanced database programming topics based on the Python DB-API...
201
201
movies
eye 201
favorite 0
comment 0
200
200
movies
eye 200
favorite 0
comment 0
188
188
movies
eye 188
favorite 0
comment 0
187
187
movies
eye 187
favorite 1
comment 0
Probabilistic Programming allows flexible specification of statistical models to gain insight from data. Estimation of best fitting parameter values, as well as uncertainty in these estimations, can be automated by sampling algorithms like Markov chain Monte Carlo (MCMC). The high interpretability and flexibility of this approach has lead to a huge paradigm shift in scientific fields ranging from Cognitive Science to Data Science and Quantitative Finance. PyMC3 is a new Python module that...
181
181
movies
eye 181
favorite 0
comment 0
175
175
movies
eye 175
favorite 0
comment 0
167
167
movies
eye 167
favorite 0
comment 0
The Cython compiler is the most widely used static compiler for Python. The code it generates is used in countless critical applications that process huge amounts of data world wide. Cython has two major use cases: to compile Python code into fast native extension modules, and to connect native code to the CPython runtime. The main goal of the Cython project is to make it easy for users to manually optimise their Python code to make it run at C speed. This talk by one of the core developers...
149
149
movies
eye 149
favorite 0
comment 0
146
146
movies
eye 146
favorite 0
comment 0
146
146
movies
eye 146
favorite 0
comment 0
Experiences from building a recommendation engine for patent search using pythonic NLP and topic modeling tools such as Gensim.
Topics: python, pydata, pydataberlin2014
145
145
movies
eye 145
favorite 0
comment 0
136
136
movies
eye 136
favorite 0
comment 0
128
128
movies
eye 128
favorite 0
comment 0
124
124
movies
eye 124
favorite 0
comment 0
For data, and data science, to be the fuel of the 21th century, data driven applications should not be confined to dashboards and static analyses. Instead they should be the driver of the organizations that own or generates the data. Most of these applications are web-based and require real-time access to the data. However, many Big Data analyses and tools are inherently batch-driven and not well suited for real-time and performance-critical connections with applications. Trade-offs become...
Topics: python, pydata, pydataberlin2014
120
120
movies
eye 120
favorite 0
comment 0
Scikit-Learn (http://scikit-learn.org/) is a well-known and popular framework for machine learning that is used by Data Scientists all over the world. We show in a practical way how you can add your own estimator following the interfaces of Scikit-Learn. First we give a small introduction to the design of Scikit-Learn and its inner workings. Then we show how easily Scikit-Learn can be extended by creating an own estimator. In order to demonstrate this, we extend Scikit-Learn by the popular and...
120
120
movies
eye 120
favorite 0
comment 0
When talking of parallel processing, some task requires a substantial set-up time. This is the case of Natural Language Processing (NLP) tasks such as classification, where models need to be loaded into memory. In these situations, we can not start a new process for every data set to be handled, but the system needs to be ready to process new incoming data. This talk will look at job queue systems, with particular focus on gearman. We will see how we are using it at Synthesio for NLP tasks; how...
Topics: python, pydata, pydataberlin2014
105
105
movies
eye 105
favorite 0
comment 0
104
104
movies
eye 104
favorite 0
comment 0
This talk distills information from the open source guide [Full Stack Python](http://www.fullstackpython.com/) I wrote into a 30 minute talk on web stack layers. An approximate timeline for this talk would be: * 5 min: intro story * 5 min: what the web developers need to know about virtual servers, web servers, and WSGI servers * 5 min: what do web frameworks provide? * 5 min: what are the most important parts of your web application to analyze and monitor? * 5 min: static files and execution...
103
103
movies
eye 103
favorite 0
comment 0
101
101
movies
eye 101
favorite 0
comment 0
Can empirical samples unveil the big picture? Is chess games descriptions expose good enough data to gain understanding of chess rules - legal piece moves, castling, check versus checkmate, etc. Which features are important in describing a chess game and which features are not. What is a good representation of a chess game for this uses. What is the minimal sample size which is required in order to learn this in a good enough manner and where this learning can go wrong. **Ne3 => E=mc2**...
101
101
movies
eye 101
favorite 0
comment 0
100
100
movies
eye 100
favorite 0
comment 0
100
100
movies
eye 100
favorite 0
comment 0
92
92
movies
eye 92
favorite 0
comment 0
Infrastructure/Scaling is a topic really close to me, I'd like to have the chance to talk about how we set this up in the company I work for. Our infrastructure is around 10-15 servers, provisioned on different cloud providers, so a good size infrastructure. Presentation is going to be divided in 3 parts, first part is going to be focused on comparing sysadmin and devops, then there will be an introduction to the basic concepts of Ansible. I want to spend most on the last part, which is going...
90
90
movies
eye 90
favorite 0
comment 0
88
88
movies
eye 88
favorite 0
comment 0
43% of the world's population live on less than €1.5 per day. The United Nations defines poverty as a "lack of basic capacity to participate effectively in society". While we often think of the poor as lacking primarily food and shelter, the UN definition highlights their isolation. They have the least access to society's knowledge and services and the most difficulty making themselves and their needs heard in our democracies. While smart phones and an exploding ability to collect...
87
87
movies
eye 87
favorite 0
comment 0
86
86
movies
eye 86
favorite 0
comment 0
This talk will give a basic introduction to the pySPACE framework and its current applications. pySPACE (Signal Processing And Classification Environment) is a modular software for the processing of large data streams that has been specifically designed to enable distributed execution and empirical evaluation of signal processing chains. Various signal processing algorithms (so called nodes) are available within the software, from finite impulse response filters over data-dependent spatial...
Topics: python, pydata, pydataberlin2014
83
83
movies
eye 83
favorite 0
comment 0
77
77
movies
eye 77
favorite 0
comment 0
76
76
movies
eye 76
favorite 0
comment 0
76
76
movies
eye 76
favorite 0
comment 0
73
73
movies
eye 73
favorite 0
comment 0
73
73
movies
eye 73
favorite 0
comment 0
73
73
movies
eye 73
favorite 0
comment 0
Computers have traditionally been thought as tools for performing computations with numbers. Of course, its name in English has a lot to do with this conception, but in other languages, like the french 'ordinateur' (which express concepts more like sorting or classifying), one can clearly see the other side of the coin: computers can also be used to extract (usually new) information from data. Storage, reduction, classification, selection, sorting, grouping, among others, are typical operations...
Topics: python, pydata, pydataberlin2014
73
73
movies
eye 73
favorite 0
comment 0
This talk will be a practical introduction to MkDocs, a new tool for creating documentation from Markdown: * The background behind MkDocs and the motivation for creating a new documentation tool. * Comparing against Sphinx - what benefits each tool provides. * Getting starting with MkDocs - how to write, theme and publish your documentation. * Under the covers - how MkDocs works, and some asides on a couple of the neat Python libraries that it uses.
71
71
movies
eye 71
favorite 0
comment 0
This talk will be about my latest project in mall analytics, where we estimated visitor trends in malls around the globe using telco data as a basis, and employed map reduce technologies and data science to extrapolate from this basis to reality and correct for biases. We succeeded in extracting valuable information such as count of visitors per hour, demographics breakdown, competitor analysis and popularity of the mall among different parts of the surrounding areas, all the while preserving...
Topics: python, pydata, pydataberlin2014
70
70
movies
eye 70
favorite 0
comment 0
69
69
movies
eye 69
favorite 0
comment 0
## Who I am and motivation I work with hosted Elasticsearch and have interacted with lots of developers. We see what many struggle with. Some relevant theory helps a lot. What follows has already lead to many "Aha!"-moments and developers piecing things together herself. ## The inverted index The most important index structure is actually very simple. It is essentially a sorted dictionary of terms, with a list of postings per term. We show three simple sample documents and the...
68
68
movies
eye 68
favorite 0
comment 0
67
67
movies
eye 67
favorite 0
comment 0
When you think about Python+testing, you usually think about testing your code - unittests, mostly. But it is not the only case! When you have a big system, you need to test it on much higher level - if only to check if all the components are wired in the right way. You may do it manually, but it is tedious and time-consuming - so you want to automate it. And here comes Python - the language of choice in many QA departments. I will tell about differences between unittesting and system testing...
66
66
movies
eye 66
favorite 0
comment 0
65
65
movies
eye 65
favorite 0
comment 0
65
65
movies
eye 65
favorite 0
comment 0
Python has had a long history in Scientific Computing which means it has had the fundamental building blocks necessary for doing Data Analysis for many years. As a result, Python has long played a role in scientific problems with the largest data sets. Lately, it has also grown in traction as a tool for doing rapid Data Analysis. As a result, Python is the center of an emerging trend that is unifying traditional High Performance Computing with "Big Data" applications. In this talk I...
64
64
movies
eye 64
favorite 0
comment 0
Python for Zombies is the first MOOC (Massive Open Online Course) to teach programming in portuguese. Our first edition had 15.000 "zombies". This course is very different from traditional MOOCs (Edx, Coursera, etc). First is an initiative of the brazilian Python community. Besides the difference of language, it is very "casual", as I have little time, I recorded most of the videos on my trips between meetings, conferences or at mealtimes. This MOOC is "brazilian",...
63
63
movies
eye 63
favorite 0
comment 0
The talk will give an overview on how to add centralized, structured logging to a python application running on multiple servers. It will focus on useful patterns and show the benefits from structured logging.
61
61
movies
eye 61
favorite 0
comment 0
59
59
movies
eye 59
favorite 0
comment 0
Programming isn't just about software architectures and object-oriented design; it is also about solving algorithmic problems *efficiently*, some of which are really *hard* [[Hetland, 2010]][0]. The way we decide to *represent* and to *solve* our problems (i.e., the *data structure* and the *algorithm* we use, respectively) has a great impact on the overall *complexity* of our solution. In this scenario, **graphs** define a powerful mental (and mathematical) model to deal with many algorithmic...
58
58
movies
eye 58
favorite 0
comment 0
This talk is about several approaches to implement high performing numerical algorithms and applications in Python. It introduces into approaches like multi-threading, parallelization (CPU/GPU), dynamic compiling, high throughput IO operations. The approach is a practical one in that every approach is illustrated by specific Python examples. The talk uses, among others, the following libraries: * NumPy * numexpr * IPython.Parallel * Numba * NumbaPro * PyTables
57
57
movies
eye 57
favorite 0
comment 0
56
56
movies
eye 56
favorite 0
comment 0
Presentation describes how to implement debugger for Python and has 4 parts: * Tracing Python code Explains how to use trace function * Debugger Architecture Explains which parts consists of a modern full-fledged debugger. * A Bit of Details Explains how to make code to work for all python versions and implementations, survive gevent monkey-patching etc. * Cool Features Explains how to implement exception handling and multiprocess debugging
56
56
movies
eye 56
favorite 0
comment 0
54
54
movies
eye 54
favorite 0
comment 0
54
54
movies
eye 54
favorite 0
comment 0
Conda is an open source package manager, which can be used to manage binary packages and virtual environments on any platform. It is the package manager of the Anaconda Python distribution, although it can be used independently of Anaconda. We will look at how conda solves many of the problems that have plagued Python packaging in the past, followed by a demonstration of its features. We will look at the issues that have plagued packaging in the Python ecosystem in the past, and discuss how...
Topics: python, pydata, pydataberlin2014
54
54
movies
eye 54
favorite 0
comment 0
53
53
movies
eye 53
favorite 0
comment 0
53
53
movies
eye 53
favorite 0
comment 1
favoritefavoritefavoritefavorite ( 1 reviews )